Description: 网页抓取器又叫网络机器人(Robot)、网络爬行者、网络蜘蛛。网络机器人(Web Robot),也称网络蜘蛛(Spider),漫游者(Wanderer)和爬虫(Crawler),是指某个能以人类无法达到的速度不断重复执行某项任务的自动程序。他们能自动漫游与Web站点,在Web上按某种策略自动进行远程数据的检索和获取,并产生本地索引,产生本地数据库,提供查询接口,共搜索引擎调用。-web crawling robots - known network (Robot), Web crawling, spider network. Network Robot (Web Robot), also called network spider (Spider), rovers (Wanderer) and reptiles (Crawler), is a human can not reach the speed of repeated execution of a mandate automatic procedures. They can automatically roaming and Web site on the Web strategy by some automatic remote data access and retrieval, Index and produce local, have local database, which provides interfaces for a total of search engine called. Platform: |
Size: 20429 |
Author:shengping |
Hits:
Description: Lucene is an Open Source, mature and high-performance Java search engine. It is highly flexible, and scalable from hundreds to millions of documents.
Luke is a handy development and diagnostic tool, which accesses already existing Lucene indexes and allows you to display and modify their contents in several ways:
browse by document number, or by term
view documents / copy to clipboard
retrieve a ranked list of most frequent terms
execute a search, and browse the results
analyze search results
selectively delete documents from the index
reconstruct the original document fields, edit them and re-insert to the index
optimize indexes
and much more...
Platform: |
Size: 1163146 |
Author:童小军 |
Hits:
Description: < 网络机器人java编程指南>>的配套源程序,研究如何实现具有Web访问能力的网络机器人的书。从Internet编程的基本原理出发,深入浅出、循序渐进地阐述了网络机器人程序Spider、Bot、Aggregator的实现技术,并分析了每种程序的优点及适用场合。本书提供了大量的有效源代码,并对这些代码进行了详细的分析。通过本书的介绍,你可以很方便地利用这些技术,设计并实现网络蜘蛛或网络信息搜索器等机器人程序。-lt; Lt; Java Web Robot Programming Guide gt; Gt; The matching source to study how to achieve the ability to visit the Web network robot book. Internet programming from the basic principles and simple, step by step elaborate procedures of the network robot Spider, Bot, aggregator of technology, and analysis of the merits of each procedure and applicable occasions. The book provides plenty of source code, as well as code for a detailed analysis. Through the book, you can easily take advantage of these technologies, network design and network information spider or other search engine robot procedures. Platform: |
Size: 1437696 |
Author:黄 |
Hits:
Description: 搜索引擎,一个开源项目,不错的,功能很全-search engine, an open-source project, it is true, is the entire function Platform: |
Size: 8275968 |
Author:周 |
Hits:
Description: 网格搜索引擎技术研究:一篇论文。网格搜索引擎对于当前热门的网格技术(grid)来说是一个空缺,值得研究。-grid search engine technology : a thesis. Grid search engine for the current hot Grid (grid) is a vacancy, it is worth studying. Platform: |
Size: 219136 |
Author:周 |
Hits:
Description: 全文检索者站内搜索引擎论坛免费版1.0正式版for leadbbs314,dvbbs7 sp2 access数据库-were retrieved by the search engine stations Forum free version of the final version 1.0 for leadbbs314, dvbbs7 sp2 database access Platform: |
Size: 519168 |
Author:s |
Hits:
Description: Lucene Web interface, use XML as a lightweight protocol. developer can convert data source (text, DB, MS Word, PDF... etc) into xml format, indexing with lucene engine, and get full text search result via HTTP, with XML format output, user can easily intergrated with JSP ASP PHP front end or use XSLT at server side transform output.
Platform: |
Size: 2890752 |
Author:张和 |
Hits:
Description: 一个由java实现的搜索引擎代码。实现对网页内容的分析和采集功能-a realization by the search engine code. Achieving the right Web content collection and analysis functions Platform: |
Size: 1179648 |
Author:杜永鑫 |
Hits:
Description: 一个小的搜索引擎,仿google搜索引擎开发的小程序。仅供个人学习使用-a small search engine Google search engine imitation of small procedures. Only individual learning Platform: |
Size: 1186816 |
Author:王小雨 |
Hits:
Description: soo search是一个服务的接口,目标为简化搜索引擎的定制规则,加速全文索引的快速高效的开发。通过javaBean技术,把资源对象化,以建立方便的资源管理机制。soosoo search把资源的输入和输出通过一个值对象(bean)和用户进行交互,这样soosoo search可以快速的和现有的j2ee开源框架进行集成。soosoo search提供了两个接口,一个是索引器接口,一个是检索器接口。而这里两个接口的实现都是通过公共的数据模板进行资源的格式化。利用用户定制的javaBean对象,把外部需要建立索引的资源进行格式化。同时用户可以通过这个javaBean对象封装数据的检索结果。-soo search is a service interface, simplifying the objectives of the Custom search engine rules, the index accelerated by the rapid and efficient development. JavaBean through technology, object-oriented resources, to facilitate the establishment of the resource management mechanisms. Soosoo search resource inputs and outputs through a target value (bean) and user interaction, such soosoo search quickly and creates the framework for integration revenue. Soosoo search for the two interfaces, one is indexing interface, is a search interface. And here the two interfaces are achieved through public data templates for formatting resources. Using customized javaBean object, the need for external resources index format. While users can target the javaBean Packaging data retrieval results. Platform: |
Size: 1242112 |
Author:张林 |
Hits:
Description: 一个用java程序。java语言编制的搜索引擎界面。跟baidu的差不多-use a java procedures. Java language search engine interface. With the same engines. Platform: |
Size: 2048 |
Author:晃晃 |
Hits:
Description: 这是一个简单的综合搜索引擎!初学者可以根据这些写法学习jsp-This is a simple, comprehensive search engine! Beginners can learn from these written jsp Platform: |
Size: 37888 |
Author:三水 |
Hits: